Supporting Systolic and Memory Communication in iWarp

نویسندگان

  • Shekhar Borkar
  • Robert Cohn
  • George Cox
  • Thomas Gross
  • H. T. Kung
  • Monica Lam
  • Margie Levine
  • Brian Moore
  • Craig Peterson
  • Jim Susman
  • Jim Sutton
  • John Urbanski
  • Jon Webb
چکیده

TM general computing; whereas systolic communication is iWarp is a parallel architecture developed jointly by efficient and well suited for speed critical applications. Carnegie Mellon University and Intel Corporation. The iWarp communication system supports two widely used interprocessor communication styles: memory A major achievement of the iWarp effort is the derivation of communication and systolic communication. This paper a common design to satisfy the requirements of both systolic describes the rationale, architecture, and implementation for and memory communication styles. This is made possible by the iWarp communication system. two important innovations in communication: (1) program access to communication and (2) logical channels. The former allows programs to access data as they are transmitted The sending or receiving processor of a message can perform and to redirect portions of messages to different destinations either memory or systolic communication. In memory efficiently. The latter increases the connectivity between the communication, the entire message is buffered in the local processors and guarantees communication bandwidth for memory of the processor before it is transmitted or after it is classes of messages. These innovations have provided a received. Therefore communication begins or terminates at focus for the iWarp architecture. The result is a the local memory. For conventional message passing communication system that provides a total bandwidth of 320 methods, both sending and receiving processors use memory MBytes/sec and that is integrated on a single VLSI communication. In systolic communication, individual data component with a 20 MFLOPS plus 20 MIPS long items are transferred as they are produced, or are used as they instruction word computation engine. are received, by the program running at the processor. Memory communication is flexible and well suited for iWarp is a trademark of Intel Corp. Warp is a servicemark of Carnegie Mellon University. This research was supported in part by the Defense Advanced Research Projects Agency, Information Science and Technology Office, under the title " Research on Parallel Computing", ARPA Order No. 7330, issued by DARPA/CMO under Contract MDA972-90-C-0035. Authors’ affiliations: R. Cohn, T. Gross, H. T. Kung, and J. Webb are with Carnegie Mellon University; S. Borkar, G. Cox, M. Levine, B. Moore, W. Moore, C. Peterson, J. Susman, J. Sutton, and J. Urbanski are with Intel; M. Lam, who was a Ph.D. student at Carnegie Mellon University, is now with Computer Systems Laboratory, Stanford University, Stanford, CA 94305. An earlier version of this paper appears in Proceedings of the 17th Annual International Symposium on Computer Architecture, Seattle, Washington, May 1990, pp. 70 81. operates at a frequency of 20 MHz, with the exception that the

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Communication Style on Machine Resource Usage for the iWarp Parallel Processor

Programs executing on a private-memory parallel system exchange data by explicitly sending and receiving messages. Two communication styles have been identified for such systems: memory communication (each message exchanged between two processors is buffered in memory, e.g. as in message passing) and systolic communication (each word of a message is transmitted directly from the sender processo...

متن کامل

iWarp: An Integrated Solution to High-Speed Parallel Computing

An iWarp component connected to a local memory forms Abstract an iWarp cell; up to 64 MBytes of memory are directly addressable. A large array of iWarp cells will deliver an iWarp is a system architecture for high speed signal, image enormous computing bandwidth never before realized in disand scientific computing. The heart of an iWarp system is the iWarp component: a single chip processor tha...

متن کامل

Utilizing New Communication Features in Compiliation for Private-Memory Machines

The communication system of some 3rd generation private-memory machines provides long-lived connections (which reserve communication resources like buffers between nodes) as well as direct access by the computation unit(s) of the node to the communication system. These features allow a compiler to find innovative solutions when compiling data-parallel programs for a private-memory machine. In t...

متن کامل

Data Reshuuing in Support of Fast I/o for Distributed-memory Machines

Achieving high-speed network I/O on distributed-memory systems is a hard problem because their ar-chitectures are, in general, ill-suited for communication processing. One of the problems is that messages are distributed over the private memories of the distributed-memory system. This can result in poor performance since communication includes a complex scatter/gather operation. This paper pres...

متن کامل

Enabling Applications for RDMA: Distributed Compilation Revisited

Remote Direct Memory Access (RDMA) traditionally aimed at improving high-performance computing applications and storage-area networks because CPU cycles and memory bus load can be reduced significantly by applying the zero-copy and direct data-placement techniques offered by RDMA. With the advent of iWARP, the RDMA technology is now available to the ubiquitous TCP/IP infrastructure and, thus, b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990